NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning Transferable Features for Implicit Neural Representations

Vyas, Kushal; Humayun, Ahmed I; Dashpute, Aniket; Baraniuk, Richard B; Veeraraghavan, Ashok; Balakrishnan, Guha (September 2024, ArXiv)

Implicit neural representations (INRs) have demonstrated success in a variety of applications, including inverse problems and neural rendering. An INR is typically trained to capture one signal of interest, resulting in learned neural features that are highly attuned to that signal. Assumed to be less generalizable, we explore the aspect of transferability of such learned neural features for fitting similar signals. We introduce a new INR training framework, STRAINER that learns transferrable features for fitting INRs to new signals from a given distribution, faster and with better reconstruction quality. Owing to the sequential layer-wise affine operations in an INR, we propose to learn transferable representations by sharing initial encoder layers across multiple INRs with independent decoder layers. At test time, the learned encoder representations are transferred as initialization for an otherwise randomly initialized INR. We find STRAINER to yield extremely powerful initialization for fitting images from the same domain and allow for ≈+10dB gain in signal quality early on compared to an untrained INR itself. STRAINER also provides a simple way to encode data-driven priors in INRs. We evaluate STRAINER on multiple in-domain and out-of-domain signal fitting tasks and inverse problems and further provide detailed analysis and discussion on the transferability of STRAINER's features.
more » « less
Full Text Available
ElasticDiffusion: Training-Free Arbitrary Size Image Generation Through Global-Local Content Separation

https://doi.org/10.1109/CVPR52733.2024.00631

Haji-Ali, Moayed; Balakrishnan, Guha; Ordonez, Vicente (June 2024, IEEE Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available
MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations

Wong, Mike; Ramanujam, Murali; Balakrishnan, Guha; Netravali, Ravi (April 2024, e 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI '24))

Full Text Available
Improving Denoising Diffusion Probabilistic Models via Exploiting Shared Representations

https://doi.org/10.1109/IEEECONF59524.2023.10476867

Pirhayatifard, Delaram; Toghani, Mohammad Taha; Balakrishnan, Guha; Uribe, César A (October 2023, IEEE)

Full Text Available
SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries

https://doi.org/10.1109/CVPR52729.2023.00369

Humayun, Ahmed Imtiaz; Balestriero, Randall; Balakrishnan, Guha; Baraniuk, Richard (June 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Current Deep Network (DN) visualization and inter-pretability methods rely heavily on data space visualizations such as scoring which dimensions of the data are responsible for their associated prediction or generating new data features or samples that best match a given DN unit or representation. In this paper, we go one step further by developing the first provably exact method for computing the geometry of a DN's mapping - including its decision boundary - over a specified region of the data space. By lever-aging the theory of Continuous Piece- Wise Linear (CPWL) spline DNs, SplineCam exactly computes a DN's geometry without resorting to approximations such as sampling or architecture simplification. SplineCam applies to any DN architecture based on CPWL activation nonlinearities, including (leaky) ReLU, absolute value, maxout, and max-pooling and can also be applied to regression DNs such as implicit neural representations. Beyond decision boundary visualization and characterization, SplineCam enables one to compare architectures, measure generalizability, and sample from the decision boundary on or off the data manifold. Project website: bit.ly/splinecam.
more » « less
Full Text Available
MINER: Multiscale Implicit Neural Representation

Saragadam, Vishwanath; Tan, Jasper; Balakrishnan, Guha; Baraniuk, Richard G.; Veeraraghavan, Ashok (October 2022, European Conference on Computer Vision (ECCV) 2022)

We introduce a new neural signal model designed for efficient high-resolution representation of large-scale signals. The key innovation in our multiscale implicit neural representation (MINER) is an internal representation via a Laplacian pyramid, which provides a sparse multiscale decomposition of the signal that captures orthogonal parts of the signal across scales. We leverage the advantages of the Laplacian pyramid by representing small disjoint patches of the pyramid at each scale with a small MLP. This enables the capacity of the network to adaptively increase from coarse to fine scales, and only represent parts of the signal with strong signal energy. The parameters of each MLP are optimized from coarse-to-fine scale which results in faster approximations at coarser scales, thereby ultimately an extremely fast training process. We apply MINER to a range of large-scale signal representation tasks, including gigapixel images and very large point clouds, and demonstrate that it requires fewer than 25% of the parameters, 33% of the memory footprint, and 10% of the computation time of competing techniques such as ACORN to reach the same representation accuracy.
more » « less
Full Text Available
Unsupervised learning of probabilistic diffeomorphic registration for images and surfaces

https://doi.org/10.1016/j.media.2019.07.006

Dalca, Adrian V.; Balakrishnan, Guha; Guttag, John; Sabuncu, Mert R. (October 2019, Medical Image Analysis)

Full Text Available

Search for: All records